Learning Shuffle Ideals Under Restricted Distributions
نویسنده
چکیده
The class of shuffle ideals is a fundamental sub-family of regular languages. The shuffle ideal generated by a string set U is the collection of all strings containing some string u ∈ U as a (not necessarily contiguous) subsequence. In spite of its apparent simplicity, the problem of learning a shuffle ideal from given data is known to be computationally intractable. In this paper, we study the PAC learnability of shuffle ideals and present positive results on this learning problem under element-wise independent and identical distributions and Markovian distributions in the statistical query model. A constrained generalization to learning shuffle ideals under product distributions is also provided. In the empirical direction, we propose a heuristic algorithm for learning shuffle ideals from given labeled strings under general unrestricted distributions. Experiments demonstrate the advantage for both efficiency and accuracy of our algorithm.
منابع مشابه
On the Learnability of Shuffle Ideals
PAC learning of unrestricted regular languages is long known to be a difficult problem. The class of shuffle ideals is a very restricted subclass of regular languages, where the shuffle ideal generated by a string u is the collection of all strings containing u as a subsequence. This fundamental language family is of theoretical interest in its own right and provides the building blocks for oth...
متن کاملOn Shuffe Ideals of General Algebras
We extend a word language concept called shuffle ideal to general algebras. For this purpose, we introduce the relation SH and show that there exists a natural connection between this relation and the homeomorphic embedding order on trees. We establish connections between shuffle ideals, monotonically ordered algebras and automata, and piecewise testable tree languages.
متن کاملShuffle-compatible permutation statistics II: the exterior peak set
5. Dendriform structures 59 5.1. Four operations on QSym . . . . . . . . . . . . . . . . . . . . . . . . 59 5.2. Ideals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61 5.3. Application to KEpk . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63 5.4. Dendriform shuffle-compatibility . . . . . . . . . . . . . . . . . . . . 67 5.5. Criteria for Kst to be a stack i...
متن کاملShuffle Product Formulas of Multiple Zeta Values
Using the combinatorial description of shuffle product, we prove or reformulate several shuffle product formulas of multiple zeta values, including a general formula of the shuffle product of two multiple zeta values, some restricted shuffle product formulas of the product of two multiple zeta values, and a restricted shuffle product formula of the product of n multiple zeta values.
متن کاملKernel Density Estimation on Spaces of Gaussian Distributions and Symmetric Positive Definite Matrices
This paper analyses the kernel density estimation on spaces of Gaussian distributions endowed with different metrics. Explicit expressions of kernels are provided for the case of the 2-Wasserstein metric on multivariate Gaussian distributions and for the Fisher metric on multivariate centred distributions. Under the Fisher metric, the space of multivariate centred Gaussian distributions is isom...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014